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Abstract. In a software watermarking environment, several grapli the- 
oretic watermark methods use numbers as watermark values, where some 
of these methods encode the watermark numbers as graph structures. In 
this paper we extended the class of error correcting graphs by proposing 
an efficient and easily implemented codec system for encoding water- 
mark numbers as reducible permutation flow-graphs. More precisely, we 
first present an efficient algorithm which encodes a watermark number w 
as self-inverting permutation tt* and, then, an algorithm which encodes 
the self-inverting permutation tt* as a reducible permutation flow-graph 
-F[7r*] by exploiting domination relations on the elements of tt* and us- 
ing an efficient DAG representation of tt*. The whole encoding process 
takes 0(n) time and space, where n is the binary size of the number 
w or, equivalently, the number of elements of the permutation tt*. We 
also propose efficient decoding algorithms which extract the number w 
from the reducible permutation flow-graph F[t:*] within the same time 
and space complexity. The two main components of our proposed codec 
system, i.e., the self-inverting permutation vr* and the reducible permu- 
tation graph -F[7i"*], incorporate important structural properties which 
make our system resilient to attacks. 

1 Introduction 

Software watermarking is a technique that is currently being studied to prevent 
or discourage software piracy and copyright infringement. The idea is similar 
to digital (or, media) watermarking where a unique identifier is embedded in 
image, audio, or video data through the introduction of errors not detectable 
by human perception |14| . The software watermarking problem can be described 
as the problem of embedding a structure w into a program P such that w 
can be reliably located and extracted from P even after P has been subjected 
to code transformations such as translation, optimization and obfuscation |23| . 
More precisely, given a program P, a watermark w, and a key k, the software 
watermarking problem can be formally described by the following two functions: 
embed(P, It;, fc) — P' and extract (P', fc) w. 

Although digital watermarking has made considerable progress and become a 
popular technique for copyright protection of multimedia information |14I30| , re- 
search on software watermarking has recently received sufficient attention. The 
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patent by Davidson and Myhrvold [15] presented the first published software 
watermarking algorithm. The preliminary concepts of software watermarking 
also appeared in paper fT8^ and patents |21|28j . Collberg et al. [9|10j presented 
detailed definitions for software watermarking. Authors of papers [32133] have 
given brief surveys of software watermarking research (see, Collberg and Nagra 
[6] for an exposition of the main results). 

Static and Dynamic Watermarking Algorithms: There are two general 
categories of watermarking algorithms namely static and the dynamic algorithms 
[9]. A static watermark is stored inside program code in a certain format, and it 
does not change during the program execution. A dynamic watermark is built 
during program execution, perhaps only after a particular sequence of input. 
It might be retrieved by analyzing the data structures built when watermarked 
program is running. In other cases, tracing the program execution may be re- 
quired. Further discussion of static and/or dynamic watermarking issues can be 
found in |15l21l31j . 

Algorithms and Techniques for Software Watermarking: A lot of research 
has been done on software watermarking. The major software watermarking 
algorithms currently available are based on several techniques, among which the 
register allocation, spread-spectrum, opaque predicate, abstract interpretation, 
dynamic path techniques (see, [2 5 12 13 22 24 26 29 ). 

Recently, several software watermarking algorithms have been appeared in 
the literature that encode watermarks as graph structures. In general, such en- 
codings make use of an encoding function encode which converts a watermark- 
ing number w into a graph G, encode{w) — > G, and also of a decoding function 
decode that converts the graph G into the number w, decode(G) w; we usu- 
ally call the pair (encode, decode) as graph codec 0. From a graph-theoretic 
point of view, we are looking for a class of graphs Q and a corresponding codec 
(encode, decode)g; with the following properties: 

• Appropriate Graph Types: Graphs in Q should be directed having such proper- 
ties, i.e., nodes with small outdegree, so that matching real program graphs; 

• High Resiliency: The function decode{G) should be insensitive to small changes 
of G, i.e., insertions or deletions of a constant number of nodes or/and edges; 
that is, ii G £ G and decode{G) w then decode{G') w with G' « G; 

• Small Size: The size \Pw \ — \P\ of the embedded watermark should be small; 

• Efficient Codecs: The functions encode and decode should be computed in 
polynomial time. 

In 1996, Davidson and Myhrvold [15] proposed the first software watermark- 
ing algorithm which is static and embeds the watermark by reordering the ba- 
sic blocks of a control flow-graph. Based on this idea, Venkatesan, Vazirani and 
Sinha |31| proposed the first graph-based software watermarking algorithm which 
embeds the watermark by extending a method's control flow-graph through the 
insertion of a directed subgraph; it is a static algorithm and is called VVS or GTW. 
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In [3T] the construction of a directed graph G (or, watermark graph G) is not 
discussed. Collberg et aL [8] proposed an implementation of GTW, which they cah 
GTWsm, and it is the first pubhcly available implementation of the algorithm GTW. 
In GTWsm the watermark is encoded as a reducible permutation graph (RPG) 
[7], which is a reducible control flow-graph with maximum out-degree of two, 
mimicking real code. Note that, for encoding integers the GTWsm method uses 
only those permutations that are self-inverting. The first dynamic watermarking 
algorithm (CT) was proposed by Collberg and Thomborson [9]; it embeds the 
watermark through a graph structure which is built on a heap at runtime. 

Attacks: A successful attack against the watermarked program prevents 
the recognizer from extracting the watermark while not seriously harming the 
performances or correctness of the program ■ It is generally assumed that the 
attacker has access to the algorithm used by the embedder and recognizer. There 
are four main ways to attack a watermark in a software. 

• Additive attacks; Embed a new watermark into the watermarked software, so 
the original copyright owners of the software cannot prove their ownership 
by their original watermark inserted in the software; 

• Subtractive attacks: Remove the watermark of the watermarked software 
without affecting the functionality of the watermarked software; 

• Distortive attacks: Modify watermark to prevent it from being extracted by 
the copyright owners and still keep the usability of the software; 

• Recognition attacks: Modify or disable the watermark detector, or its inputs, 
so that it gives a misleading result. For example, an adversary may assert that 
"his" watermark detector is the one that should be used to prove ownership 
in a courtroom test. 

Attacks against graph-based software watermarking algorithms can mainly 
occur in the following three ways: (i) Edge-flip attacks, (ii) Edges-addition/deletion 
attacks, and (iii) Node-addition/deletion attacks. 

Our Contribution: In this paper we present an efficient and easily implemented 
algorithm for encoding numbers as reducible permutation flow-graphs through 
the use of self- inverting permutations (or, for short, SIP). 

More precisely, we first present an efficient algorithm which encodes a num- 
ber (integer) w as self- inverting permutation tt*. Our algorithm, which we call 
Encode_W-to-SIP, takes as input an integer w, computes first its binary repre- 
sentation bib2 • ■ • bn, then constructs a bitonic permutation on 2n -I- 1 numbers, 
and finally produces a self-inverting permutation tt* of length 2n+l in 0{n) time 
and space. We also present a decode algorithm which extracts the integer w from 
the self-inverting permutation tt* within the same time and space complexity; 
we call the decode algorithm Decode_SIP-to-W. 

Having designed an efficient method for encoding integers as self-inverting 
permutations, we next describe an algorithm for encoding a self-inverting permu- 
tation into a directed graph structure having properties capable to match real 
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program graphs. In particular, we propose the algorithm Encode_SIP-to-RPG 
which encodes the self-inverting permutation tt* as a reducible permutation flow- 
graph F[Tr*] by exploiting domination relations on the elements of tt* and using 
an efficient DAG representation of tt*. The whole encoding process takes 0{n) 
time and requires 0{n) space, where n is the length of the permutation tt*. 
We also propose an efficient and easily implemented algorithm, the algorithm 
DecodeJlPG-to-SIP, which extract the self-inverting permutation tt* from the 
reducible permutation flow-graph F[tt*] by converting first the graph F[k*] into 
a directed tree T[7r*] and then applying DFS-search on T[7r*]. The decoding pro- 
cess takes time and space linear in the size of the flow-graph i^[7r*], that is, the 
algorithm Decode_RPG-to-SIP takes 0{n) time and space. We point out that 
the only operations used by the decoding algorithm are edge modifications on 
F[tt*] and DFS-search on trees. 

It is worth noting that our codec (encode, decode)i?[^.] system incorporates 
several important properties which characterize it as an efficient and easily im- 
plemented software watermarking component. In particular, the reducible per- 
mutation flow-graph F[7r*] does not difli'er from the graph data structures built 
by real programs since its maximum outdegree does not exceed two and it has 
a unique root node so the program can reach other nodes from the root node. 
The function Decode_RPG-to-SIP is high insensitive to small edge-changes and 
quite insensitive to small node-changes of -F[7r*], and the graph F[tt*] unable us 
to correct such edge changes. Moreover, the self-inverting permutation tt* cap- 
tures important structural properties, due to the bitonic property used in the 
construction of tt*, which make our codec system resilient to attacks. 

Finally, we point out that our codec (encode, decode)i?[^.] system has very 
low time and space complexity which is 0{n) where n is the number of bits 
in a binary representation of the watermark integer w. Indeed, both functions 
Encode_W-to-SIP and Decode_SIP-to-W are computed in time and space lin- 
ear in the binary size of the watermark integer w. Moreover, the functions 
Encode_SIP-to-RPG and DecodeJlPG-to-SIP are also computed in linear time 
and space; in particular, the function Encode_SIP-to-RPG is computed in time 
and space linear in the length of the self-inverting permutation n* which is 0{n), 
while the function Decode_RPG-to-SIP is computed in time and space linear in 
the size of the flow-graph f [tt*] which is also 0{n). 



2 Preliminciries 

We consider finite graphs with no multiple edges. For a graph G, we denote by 
V{G) and E{G) the vertex set and edge set of G, respectively. The neighbor- 
hood N{x) of a vertex x of the graph G is the set of all the vertices of G which 
are adjacent to x. The degree of a vertex x in the graph G, denoted deg(x), 
is the number of edges incident on x; thus, d{x) = \N(x)\. For a node x of a 
directed graph G, the number of head-endpoints of the directed edges adjacent 
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to X is called the indegree of the node denoted indeg{x), and the number of 
tail-endpoints is its outdegree, denoted outdeg{x). 

A path in a graph G of length /c is a sequence of vertices (?;o, f i, . . . , ffe) such 
that (wi-i, Wi) G E{G) for i = 1, 2, . . . , fc. A path is called simple if none of its 
vertices occurs more than once. A path (simple path) (wq, wi, . . . , Ufc) is a cycle 
[simple cycle) of length fc + 1 if (wojWfc) G E(G). 

Next, we introduce some definitions that are key-objects in our algorithms 
for encoding numbers as graphs. Let tt be a permutation over the set Nn = 
{1, 2, . . . , n}. We think of permutation tt as a sequence (tti, 7r2, . . . , 7r„), so, for 
example, the permutation tt = (1,4,2,7,5,3,6) has tti = I, 1:2 — 4, ect. No- 
tice that TTj^^ is the position in the sequence of the number i; in our example, 
TTj^ = 2, vrfi = 4, TTg^i = 6, ect [T7] . 

Definition 1: The inverse of a permutation (tti, 7r2, . . . , 7r„) is the permutation 
{qi,q2, ■ . ■ , Qn) with g^ri = '"'gi = j. A self-inverting permutation (or, involution) 
is a permutation that is its own inverse: tTt^. — i. 

By definition, every permutation has a unique inverse, and the inverse of the 
inverse is the original permutation. Clearly, a permutation is a self-inverting per- 
mutation if and only if all its cycles are of length 1 or 2; hereafter, we shall denote 
a 2-cycle as c = (x, y) and an 1-cycle as c = (x), or, equivalently, c = (x, x). 

Definition 2: Let Ci,2 = {ci = {xi,yi),C2 = (2:2, 2/2), • • • , Cfc = (xfc,2/fc)} be 
the set of all the cycles of a self-inverting permutation tt such that Xi < yi 
(1 < i < fc), and let -< be a linear order on Ci^2 such that Ci -< Cj if x.^ < Xj, 
1 < ijj < fc- A sequence C = (ci,C2, . . . ,Cfc) of all the cycles of a self-inverting 
permutation vr is called increasing cycle representation of tt if ci C2 ^ • • • -< Cfc. 
The cycle ci is the minimum element of the sequence C. 

Let TT be a permutation on N = {1, 2, . . . , n}. We say that an element i of 
the permutation vr dominates the element j if « > j and tt~^ < t^J^- An element 
i directly dominates (or, for short, didominates) the element j if i dominates j 
and there exists no element fc in tt such that i dominates fc and fc dominates j 
pS] . For example, in the permutation tt = (8,3,2,7,1,9,6,5,4), the element 7 
dominates the elements 1,6,5,4 and it directly dominates the elements 1,6. 

Definition 3: The domination (resp. didomination) set dom(i) (resp. didom(i)) 
of the element i of a permutation tt is the set of all the elements of tt that dom- 
inate (resp. didominate) the element i. 

Definition 4: An undirected graph G with vertices numbered from 1 to n; 
that is, V{G) ~ {l,2,...,n}, is called a permutation graph if there exists a 
permutation tt = (7ri,7r2, . . . ,7r„) on iV„ such that, {i,i) G E{G) if and only if 

(i-j)(^r'-^7') <o. 
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A flow-graph is a directed graph F with an initial node s from which ah other 
nodes are reachable. A directed graph G is strongly connected when there is a 
path X ^ y for all nodes x, y in V{G). A node u is an entry for a subgraph H 
of the graph G when there is a path p — (yi, ?/2, ■ • • , 2/fcj a^) such that pC\H — {x}. 

Definition 5: A flow-graph is reducible when it does not have a strongly con- 
nected subgraph with two (or more) entries. 

3 Encode Watermark Numbers as Self-inverting 
Permutations 

In this section, we first introduce the notion of Bitonic Permutations and then 
we present two algorithms, namely Encode_W-to-SIP and Decode_SIP-to-W, for 
encoding an integer w into an self-inverting permutation tt* and extracting it 
from TT*. Both algorithms run in 0{n) time, where n is the length of the binary 
representation of the integer w [4]. 

3.1 Bitonic Permutations 

The key-object in our algorithm for encoding integers as self-inverting permu- 
tations is the bitonic permutation: a permutation tt = (7ri,7r2, . . . ,7r„) over the 
set Nn is called bitonic if either monotonically increases and then monotonically 
decreases, or else monotonically decreases and then monotonically increases. For 
example, the permutations tti = (1, 4, 6, 7, 5, 3, 2) and 7r2 = (6, 4, 3, 1, 2, 5, 7) are 
both bitonic [3]. 

In this paper, we consider only bitonic permutations that monotonically in- 
creases and then monotonically decreases. Let tt = (tti, 7r2, . . . , Tr.i, Tr.i+i, . . . , 7r„) 
be such a bitonic permutation over the set Nn and let tt^, tt^+i be the two consecu- 
tive elements of tt such that tt^ > 7r,i+i . Then, the sequence X = (tti , 7r2, . . . , tTj) is 
called first increasing subsequence of tt and the sequence Y = (Tr^+i , 7r,i+2 , . . . , 7r„) 
is called first decreasing subsequence of tt. 

We next give some notations and terminology we shall use throughout the 
paper. Let w be an integer number. We denote hy B = bib2 ■ ■ ■ bn the binary rep- 
resentation of w. If Bi = 6162 ■ ■ - bn and B2 = did2 ■ ■ • dm be two binary numbers, 
then the number i?i||_B2 is the binary number 6162 ■ ■ • 6„c?id2 • ■ • dm- The binary 
sequence of the number B — 6162 • • • &n is the sequence B* = (61, 62, ■ • ■ , bn) of 
length n. 

Let B — 6162 ■ ■ - bn he a binary number. Then, flip{B) = 6^62 ■ ■ - b'n is the 
binary number such that b'^ = (1 resp.) if and only if 6^ = 1 (0 resp.), 1 < i < n. 

3.2 Algorithm Encode_W-to-SIP 

In this section, we present an algorithm for encoding an integer as self-inverting 
permutation. In particular, our algorithm takes as input an integer w, computes 
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the binary representation 6162 •••6n of w, and then produces a self-inverting 
permutation w* in 0{n) time. We next describe the proposed algorithm: 

Algorithm Encode_W-to-SIP 

1. Compute the binary representation B = 6162 • ■ • &n of w; 

2 . Construct the binary number i?' = 00---0|[i?||lof length 2n + l, and then 
the binary sequence B* = (&i, 62, • • • , &n') of flip{B'); 

3. Find the sequence X = (xi,X2, ■ ■ ■ , Xk) of the O's positions and the sequence 
Y = (?/i,y2, • • • ,ym) of the I's positions in B* from left-to-right; 

4. Construct the bitonic permutation tt^ = X\\Y^ on n' = 2n -|- 1 numbers; 

let TT^ = (xi,X2, . . . ,Xk,ym,Vm-l, ■ • ■ 

5. Set (7ri,7r2, . . . ,7rfe,7rfc+i,7rfe+2, • • • ,7r„') = {xi,X2, ■ ■ ■ ,Xk,ym,ym-i, ■ ■ ■ lyi), 
i = 1 and j = n!\ 

while j < j do the following: 

construct the 2-cycle Cj = (tTj, tTj), and set i = « + 1 and j = j — 1; 
end-while; 

if i = J then construct the 1-cycle Cj = (tTj); 

6. Construct the permutation tt* = (771,772, .. . ,7r„/) on n' = 2n + 1 numbers 
such that TTi = i, 1 < i < n'; 

7. Let C be the set of all cycles computed at step 5; 

for each 2-cyclc (77^,77^) S C set 77,^, = tTj and tTttj = T^i\ 

8. Return the self-inverting permutation tt*; 

Exeimple 1: Let w = 12 be the input watermark integer in the algorithm 
Encod.e_W-to-SIP. We first compute the binary representation B = 1100 of the 
number 12; then we construct the binary number B' = 000011001 and the bi- 
nary sequence B* = (1, 1, 1, 1, 0, 0, 1, 1, 0) of flip{B'); we compute the sequences 
X = (5,6,9) and Y = (1,2,3,4,7,8), and then construct the bitonic permuta- 
tion TT = (5,6,9,8,7,4,3,2,1) on n' = 9 numbers; since n' = 9 odd, we select 
4 pairs (5, 1), (6,2), (9,3), (8,4) and the number 7 and then construct the self- 
inverting permutation it* = (5, 6, 9, 8, 1, 2, 7, 4, 3). 

Time and Space Complexity. The encoding algorithm Encode_W-to-SIP performs 
basic operations on sequences of lengths 0{n), where n is the number of bits in 
the binary representation of w (sec Figure 1); hereafter, for the number n we 
shall call the term binary size of the integer w. Moreover, all the operations 
are executed in place, i.e., the algorithm uses no additional space except of a 
constant number of variables. It is easy to see that the whole encoding process 
requires 0{n) time and space. Thus, the following theorem holds: 

Theorem 1. Let w be an integer and let 6162 • • ■ 6n be the binary representation 
ofw. The algorithm Encode_W-to-SIP encodes the number w in a self-inverting 
permutation tt* of length 2n -|- 1 in 0{n) time and space. 
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3.3 Algorithm Decode_SIP-to-W 

Next, we present an extraction algorithm, that is, an algorithm for decoding a 
self-inverting permutation. More precisely, our extraction algorithm, which we 
call Dccodc_SIP-to-W, takes as input a self-inverting permutation tt* produced 
by Algorithm Encode_W-to-SIP and returns its corresponding integer w. The 
time complexity of the decode algorithm is also 0{n), where n is the length of 
the permutation tt* . We next describe the proposed algorithm: 

Algorithm Decode_SIP-to-W 

1. Compute the increasing cycle representation C = (ci, C2, . . . , Cfc) of the self- 
inverting permutation tt* = (tti, 7r2, . . . , 7r„'), where n' = 2n + 1, that is, 

ci -< C2 ^ ■ ■ ■ ^ Ck; 

2. Set i = 1 and j = n': 

3. Construct the permutation of length n' as follows: 

while the set C is not empty, do the following: 

Select the minimum clement c of the sequence C; 

Case 1: the selected cycle c has length 2 and let c = (a, b): 

TTi = b and Wj = a; 

i = i + 1 and J = J — 1; 
Case 2: the selected cycle c has length 1 and let c = (a): 

TTi = a and i = i + 
Remove the cycle c from C; 

4. Find the first increasing subsequence X = (tti, 7r2, . . . , TTfc) and then the 
decreasing subsequence Y ~ {nk+i, • • • , T^k') of tt; 

5. Construct the binary sequence B* = (bi, 62, . . . , 6„') as follows: 

set O's in positions 7ri,7r2, ■ • • ,7rfc and I's in positions 7rfe+i,7rfe+2, . . . ,77^/; 

6. Compute B' = flip{B*) ^ (&i, &2, • • • , &n, • • • , 

7. Return the integer w of the binary number B = bn+ibn+2 ■ ■ ■ bn'-V, 

Example 2: Let tt* = (5, 6, 9, 8, 1, 2, 7, 4, 3) be a self-inverting permutation pro- 
duced by the algorithm Encode_W-to-SIP. The cycle representation of tt* is the 
following: (1,5), (2, 6), (3, 9), (4, 8), (7); from the cycles we construct the permu- 
tation TT = (5, 6, 9, 8, 7, 4, 3, 2, 1); then, we compute first increasing subsequence 
X = (5, 6, 9) and the first decreasing subsequence Y = (8, 7, 4, 3, 2, 1); we then 
construct the binary sequence B* = (1,1,1,1,0,0,1,1,0) of length 9; we flip 
the elements of B* and construct the sequence B' = (0,0,0,0,1,1,0,0,1); the 
binary number 1100 is the integer w = 12. 

Time and Space Complexity. It is easy to see that the decoding algorithm 
Decode_SIP-to-W performs the same basic operations on sequences of lengths 
0{n) as the encoding algorithm (see Figure 1). Thus, we obtain the following 
result: 
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w. watermarking integer 
A 



B: binary representation 

6162 ■ • ■ f'ji of the integer w 

B': binary number 

Kb',---b'„\\hb2---h„\\h'; 

B*: sequence of Os and Is of flip{B') 
{b[,b',,...,b'„,h„b„...b„,b'l) 

i 

X: sequence of indeces of Os in B* 
Y: sequence of indeces of Is in B* 

i 

tt'': bitonic permutation constructed 
hy X andy^, i.e., X\\Y'^ 

1 

C: set of 2-cycles and 1-cycle 
constructed from it'' 



B: binary number of the 2nd block of B' 
bib2 ■ ■ ■ fe„ = Ki+ib'n+-2 ■ ■ ■ K'-l 

B': binary mmiber of sequence flip(B*) 

B*: binary sequence witfi Os (Is) at the 

positions corresp. to X's (l"s) elements 

t 

X: first increasing subsequence of it'' 
Y: remaining subsequence of tt'' 

\ 

tt'': bitonic permutation constructed from 
the increasing cycle representation 

t 

C: increasing cycle representation of tt* 



V 

TT*: self-inverting permutation 



Fig. 1. The main data components used by the algorithms Encode_W-to-SIP and 
Decode_SIP-to-W 



Theorem 2. Let n* be a self-inverting permutation of length n which encodes an 
integer w using the algorithm Encode J;l-to-SIP. The algorithm Decode^IP-to-V 
correctly decodes the permutation n* in 0{n) time and space. 



4 Encode Self-inverting Permutations as Reducible 
Permutation Graphs 



Having proposed an efficient method for encoding integers as self-inverting per- 
mutations, we next describe an algorithm for encoding a self-inverting permuta- 
tion TT* into a directed graph F[7r*]. We also describe a decoding algorithm for 
extracting the permutation tt* from the graph i^[7r*]. 
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4.1 Algorithm Encode_SIP-to-RPG 

We next propose the algorithm Encode_SIP-to-RPG which takes as input the 
self-inverting permutation n* produced by the algorithm Encode_W-to-SIP and 

constructs a reducible permutation flow-graph -F[7r*] by using an efficient DAG 
representation of the self-inverting permutation n* . The whole encoding process 
takes 0{n) time and requires 0{n) space, where n is the length of the input 
permutation tt* . 

Given a self-inverting permutation tt* of length n our decoding algorithm 
works on two phases: 

I. it first uses a strategy to transform the permutation tt* into a directed acyclic 
graph -D[7r*] using certain combinatorial properties of the elements of tt*; 

II. then, it constructs a directed graph F[tt*] on n-|-2 nodes using the adjacency 
relation of the nodes of the dag -D[7r*]. 

Next, we first describe the main ideas behind the two phases and then we 
present in details the whole algorithm. 

Construction of the DAG D[n*] from the permutation tt*: We construct 
the directed acyclic graph -D[7r*] by exploiting the didomination relation of the 
elements of tt*, as follows: 

(i) for every element i of tt*, create a vertex Vi and add it to the vertex set 

ViD[n*]y, 

(ii) compute the didomination relation of each element i of it*; recall that the 

didomination set didom{i) of the element i contains all the elements j of tt* 
that are didominated by the element i (see Definition 3); 

(iii) for every pair of vertices {vi,Vj) of the set V'(Z)[7r*]) do the following: add 
the edge {vi,Vj) in E{D['!^*\) if the element i didominates the element j in 

IT ; 

(iv) create two dummy vertices s and t and add both in y(£'[7r*]); then, add the 
edge (s, Vi) in E{D[-k*\), for every Vi with indeg{vi) = 0, and the edge {vi,t) 
in E{D[Tr*]), for every vt with outdegivi) = 0. 

Construction of the RPG F[7r*] from the graph D[7r*]: We construct the 
directed graph F[k*] by exploiting the adjacency relation of the nodes of the 
dag -D[7r*], as follows: 

(i) for every vertex Vi of D[7r*], 1 < i < n, create a node w, and add it to 

\^(F[7r*]); create the nodes Un+i and uq and add them to y(i^[7r*]); note 
that, the nodes and uq correspond to s and t, respectively; 

(ii) for every pair of nodes {ui,Ui-i) of the set y(F[7r*]) add the directed edge 
{ui,Ui-i) in E{F[Tr*]), 1 < i < n + 1; 

(iii) add the directed edge {ui,Uj) in ^^(^[Tr*]) if (v^.Vj) e E{D[Tr*]), 1 <i <n + 
1, and Vi is the maximum-labeled element of the set {vjj , , . . . , Vi.^^^^^.^ }, 
where {vi^,Vj) e E{D[Tr*]), 1 <k < indeg{i). 
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Algorithm Encode_SIP-to-RPG 

1. Construct a directed acyclic graph (dag) D[-k*] on n vertices as follows: 

o V{D[ti*]) = {ui.'-,'2....,i.J: 

o compute the set didom{i) of each element iiuTT*, l<i<n— 1; 
o for each j e didom{i), add the edge {vi,Vj) in E{D[Tr*]); 
o add two dummy vertices s — w„+i and t ~ vq in V{D[Tr*]); 
o add {s,Vi) G E{D[Tr*]), for every Vi with indeg{vi) — 0; 
o add {vi,t) e E{D[Tr*]), for every Vi with outdeg{vi) = 0; 

2. For each vertex Vi G V{D[tt*]), 1 < i < n, do 

o compute the set P{v,) = {v^ G V{D[n*])\{vj..Vi) G E{D[k*])}; 
o select the maximum-labeled vertex Vm from P{vi); 
o set p{vi) = Vm; 

3. Construct a directed graph F[Tr*] on n + 2 vertices, as follows: 

o V{F[tt*]) = {t = Uo,Ui,..., Un, Un+i = s}; 
o for i = n downto do 

add the edge {ui^i,Ui) in E{F[tt*]); we call it list pointer, 

4. For each vertex Uj G V{F[tt*]), 1 < i < n, do 

o add the edge {ui,Um) in E{F[tt*]) if Vm = p{vi)', 
we call it max-didomitation pointer, 

5. Return the graph -F[7r*]; 

Time and Space Complexity. The most time- and space-consuming steps of the 
algorithm are the construction of the directed graph D[Tr*] (Step 1) and the 
computation of the function p for each vertex Vi G y(iD[7r*]), 1 <i <n (Step 2; 
recall that p{vi) eqiials the maximum-labeled vertex of the set P{vi) contain- 
ing all the vertices of -D[7r*] which didominate vertex Vi). On the other hand, 
the construction of the reducible permutation flow-graph Fin*] (Steps 3 and 4) 
requires only the list pointers, which can be trivially computed, and the max- 
didomitation pointers, which can be computed using the function p. 

Looking at the permutation tt*, we observe that the element m which cor- 
responds to vertex of -D[7r*] is the max-indexed element on the left of the 
element i in tt* that is greater than i. Thus, the function p can be alternatively 
computed using the input permutation as follows: 

(i) insert the element s with value n + 1 into a stack S; 
top.S is the element on the top of the stack; 

(ii) for each element tt^ G tt*, i = 1, 2, . . . , n, do the following: 

while top-S < TTi do 

remove the topS from S; 

p{ui) = top^S; 
insert tt^ in stack S; 

Since each element of the input permutation tt* is inserted once in the stack 
S and is compered once with each new element the whole computation of the 
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TT* = (6,3,2,9,8,1,11,5,4, 10,7) 




Fig. 2. The main structures used or constructed by the algorithms Encode_SIP-to-RPG 
and Decode_RPG-to-SIP; that is, the self-inverting permutation tt', the dag Din*], the 
reducible graph F[-k*], and the tree T[7r*] 



function p takes 0{n) time and space, where n is the length of the permutation 
TT. Thus, we obtain the following result: 

Theorem 3. Let tt* be a self-inverting permutation of length n. The algorithm 
Encode_SIP-to-RPG for encoding the permutation tt* as a reducible permutation 
flow-graph -F'[7r*] requires 0{n) time and space. 



4.2 Algorithm Decode_RPG-to-SIP 

The algorithm Encode_SIP-to-RPG produces reducible permutation flow-graph 
F[7r*] in which it encodes a self-inverting permutation tt* . Thus, we are interested 
in designing an efficient and easily implemented algorithm for extracting the 
permutation tt* from the graph i^[7r*]. 

Next, we present such a decoding algorithm, we call it Decode_RPG-to-SIP, 
which is efficient: it takes time and space linear, i.e., 0{n), in the size of the flow- 
graph -F[7r*], and easily implemented: the only operations used by the algorithm 
are edge modifications on F[7r*] and DFS-search on trees. 
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The algorithm takes as input a reducible permutation flow-graph -FfTr*] on 
71 + 2 nodes and produces a self-inverting permutation tt* of length n; it works 
as follows: 

Algorithm Decode_RPG-to-SIP 

1. Delete the directed edges {vi+i,Vi) from the edge set E{F[tt*]), I < i < n, 
and the node t = vq from l/(F[7r*]); 

2. Flip all the remaining directed edges of the graph -F[7r*]; let T[tt*] be the 
resulting tree and let s — vo,vi,V2, ■ ■ ■ , Vn be the nodes of r[7r*]; 

3. Perform DFS-search on tree T[tt*] starting at node s by always proceeding 
to the minimum-labeled child node and compute the DFS discovery time 
d[v] of each node v of T[7r*]; 

4. Order the nodes s — vq,vi,V2, ■ ■ ■ ,Vn of the tree r[7r*] by their DFS discovery 
time c?[] and let tt = (wq, v'l, v'2, ■ ■ ■ , v'^^) be the resulting order, where d[v'j] < 
d[Vj] for i < j , < i, j < n; 

5. Delete node s from the order tt; 

6. Return tt* = tt; 

Time and Space Complexity. The size of the reducible permutation graph i^[7r*] 
constructed by the algorithm Encode_SIP-to-RPG is 0{n), where n is the length 
of the permutation tt*, and thus the size of the resulting tree T[k*] is also 0{n). 
It is well known that the DFS-search on the tree T[Tr*] takes time linear in the 
size of r[7r*]. Thus, the decoding algorithm is executed in 0{n) time using 0{n) 
space. Thus, the following theorem holds: 

Theorem 4. Let F[tt*] be a reducible permutation flow-graph of size 0{n) pro- 
duced by the algorithm Encode_SIP-to-RPG. The algorithm Decode_RPG-to-SIP 
decodes the flow- graph F[Tr*] in 0{n) time and space. 

5 Properties and Attacks 

In this section, we analyze the structures of the two main components of our 
proposed codec, that is, the self- inverting permutation tt* produced by the al- 
gorithm Encode_W-to-SIP and the reducible permutation graph -F[7r*] produced 
by the algorithm Encode_SIP-to-RPG, and discuss their properties with respect 
to resilience to attacks. 

5.1 Properties of permutation tt* 

CoUberg et al. |9I7) describe several techniques for encoding watermark integers 
in graph structures. Based on the fact that there is a one-to-one correspondence 
between self-inverting permutations and isomorphism classes of RPGs, Collberg 
et al. [7] proposed a polynomial algorithm for encoding any integer w as the RPG 
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corresponding to the wth self-inverting permutation tt in this correspondence. 
This cnicoding exploits only the fact that a self-inverting permutation is its own 
inverse; it does not incorporate any other structural property. 

In our codec system proposed in this paper an integer w is encoded as self- 
inverting permutation tt* using a particular construction technique which cap- 
tures into TT* important structural properties. These properties unable us to 
identify any single change (in some cases, multiple changes) made by an attacker 

to TT*. 

The main structural properties of our self-inverting permutation tt* produced 
by the algorithm Encode_W-to-SIP can be summarized into the following three 
categories: 

• Length property: By construction the self-inverting permutation tt* has 

always odd length. Thus, any single node-modification, i.e., adding an ele- 
ment in TT* or deleting an element from tt*, can be easily identified; 

• Bitonic property: Algorithm Decode_SIP-to-W decodes the self-inverting 
permutation tt* to obtain the encoded integer w. During the decoding pro- 
cess two sequences are constructed, that is, the increasing subsequence X 
and the decreasing subsequence Y (see Step 4) , which incorporate the bitonic 
propcirty of the encoding process. If the permutation tt* has not been pro- 
duced by our encoding algorithm Encode_W-to-SIP then subsequence Y may 
not be increasing. Thus, an appropriate change to SIP n* that keeps the SIP 
property may be identified by checking the subsequence Y; 

• Block property: The algorithm Encode_W-to-SIP takes the binary repre- 
sentation of the integer w and constructs the number B' (see Step 2). The 
binary representation of B' consists of three parts (or, blocks): (i) the first 
part contains the first n bits with Os values, (ii) the second part contains 
the next n bits which forms the binary representation of the integer w, and 
(iii) the third part of length one contains a bit 1. This property is encapsu- 
lated in the structure of tt* in such a way that during the decoding process 
the binary sequence B' constructed in Step 6 of the decoding algorithm 
Decode_SIP-to-W is identical to the sequence B' constructed by the encod- 
ing algorithm Encode_W-to-SIP. If an attacker make appropriate changes to 
SIP TT* so that the resulting permutation tt* still has the SIP property, then 
the first block of the binary sequence B' may contain one or more Is or the 
third block may be 0. 

6 Properties of the Flow-graph F[7r*] 

We next describe the main properties of our codec system (encode, decode)^[7r*] j 
we mainly focus on the properties of the reducible permutation graph F[tt*] with 
respect to graph-based software watermarking attacks. 

Components of the graph F[tt*]: The reducible permutation graph F[tt*] consists 
of the following three components: 
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(1) A header node: it is a root node with outdegree one from which every other 
node in the graph F[7r*] is reachable. Note that, every control flow-graph has 
such a node. In the graph F[tt*] the header node is denoted by s: 

(2) A footer node: it is a node with outdegree zero that is reachable from 
every other node of the graph. Every control flow-graph has such a node, 
representing the method exit. In the graph F[7r*] the footer node is denoted 

hyt; 

(3) The body: it consists of n nodes Ui, U2, . . . , w„ each with outdegree two. 
In particular, each node ?i, (1 < « < n) has exactly two outpointers: one 
points to node Uj-i, which we call list pointer, and the other points to 
node Um, which we call max-didomination pointer, where m> i; note that, 

Um> Ui> Ui-i. 

Structural Properties: In graph-based encoding algorithms, the watermark w is 
encoded into some special kind of graphs G. Generally, the watermark graph G 
should not differ from the graph data structures built by real programs. Impor- 
tant conditions are that the maximum outdegree of G should not exceed two 
or three, and that the graph G have a unique root node so the program can 
reach other nodes from the root node. Moreover, G should be resilient to at- 
tacks against edge and/or node modiflcations. Finally, G should be efficiently 
constructed. 

The proposed reducible permutation graph F[Tr*] and a corresponding codec 
(encode, decode)ir[^.] have all the above properties; in particular, the graph 
F[Tr*] and the corresponding codec have the following properties: 

• Appropriate graph types: The graph Flir*] is directed on n + 2 nodes 
with outdegree exactly two; that is, it has low max-outdegree, and, thus, it 
matches real program graphs; 

• High resiliency: Since each node in the reducible permutation graph Flir*] 
has exactly one list outpointer and exactly one max-didom outpointer, any 
single edge modification, i.e., edge-flip, edge-addition, or edge-deletion, will 
violate the outpointer condition of some nodes, and thus the modified edge 
can be easily identified and corrected. Thus, the graph F[tt*] unable us to 
correct single edge changes; 

• Small size: The size |P^| — |P| of the embedded watermark is small; 

• Efficient codecs: The codec (encode, decode)^[^*] has low time and space 
complexity; indeed, we have showed (see Theorem 4 and Theorem 5) that 
the encoding algorithm Encode_SIP-to-RPG requires 0{n) time and space, 

where n is the size of the input permutation tt* , while the decoding algorithm 
Decode_RPG-to-SIP decodes the flowgraph -F[7r*] in 0{n) time and space. 

It is worth noting that our encoding and decoding algorithms use basic data 
structures and basic operations, and, thus, they can be easily implemented. 

Unique Hamiltonian Path: It is well-known that any acyclic digraph G has at 
most one Hamiltonian path (HP); G has one HP if the subgraphs Gq,G\, . . . ,Gn 
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have only one node with indegree zero, where Gq — G and — G\{ui, U2, ■ • ■ , Wz}, 
1 < i < n — 1 (recah that n denotes the number of nodes in G). Furthermore, 
it has been shown that any reducible flow-graph has at most one Hamiltonian 
path [7]. 

We next show that the reducible permutation graph F[Tr*] produced by the 
algorithm Encode_SIP-to-RPG has always a unique Hamiltonian path, denoted 
by HP(_F[7r*]), and this Hamiltonian path can be found in 0{n) time, where n is 
the number of nodes of -F[7r*]. The following algorithm, which we call Unique_HP, 
takes as input a flow- graph i^[7r*] on n nodes and produces its unique Hamilto- 
nian path HP(F[7r*]). 

Algorithm Unique_HP 

1. Find the node uq of the graph F[n*] with outdegree one; 

2. Perform DFS-search on graph F[it*] starting at node uq and compute the 
DFS discovery time d[u] of each node u of F[Tr*]; 

3. Order the nodes wq, ""i, • • • , "n+i of the graph F[n*] by their DFS discovery 
time d[] and let HP(F[7r*]) = (uq, u'^, . . . , u'^^^i) be the resulting order, where 

< d[u'j] for i<j,0<ij<n + 1; 

4. Return HP(F[7r*]); 

Since the graph -F[7r*] contains n nodes and m = 0{n) edges, both finding 
the node of F[7t*] with outdegree one and performing DFS-search on F[7r*] take 
0{n) time and require 0(n) space. Thus, we have the following result. 

Theorem 5. Let F[k*] be a reducible permutation graph of size 0{n) produced 
by the algorithm Encode_SIP-to-RPG. The algorithm Unique_HP correctly com- 
putes the unique Hamiltonian path of F[Tr*] in 0{n) time and space. 

By construction, our reducible flow-graph F[Tr*] is a labeled graph; graphs 
in which labels (which are most commonly numbers) are assigned to nodes are 
called labeled graphs, while graphs in which individual nodes have no distinct 
identiflcations are called unlabeled graphs (unless indicated otherwise by context, 
the term "labeled" graph generally refers to a node labeled graph). Indeed, the 
labels of F[7r*] are numbers of the set {0, 1, . . . , n -I- 1}, where the label n 4- 1 is 
assigned to header node s = m„+i, the label is assigned to footer node t — uq, 
and the label n — i is assigned to the ith body node Un+i-i, 1 < i < n. 

Let i^'[7r*] be the graph which results after making some label modifications 
on the flow-graph -F[7r*]; a label modiflcation attacker may be performs swap- 
ping of the labels of two nodes, altering the value of the label of a node, or 
even removing all the labels of the graph -F[7r*] resulting an unlabeled graph. 
Since the extraction of the watermark w relies on the labels of the flow-graph 
F[7r*] (see algorithm Decode_RPG-to-SIP), it follows that our codec system 
(encode, decode) is susceptible to node modiflcation attacks. 

Thus, we are interested in flnding a way to extract the watermark w efficiently 
from -F[7r*] without relying on its labels; for example, to extract w efficiently from 
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the graph F'[7r*]. We show that, after any node modification attack on graph 
f [tt*], we can efficiently reassign the initial labels to nodes of F[Tr*] using the 
structure of the unique Hamiltonian path HP(F[7r*]). More precisely, given the 
graph F'[7r*] we can construct the flow-graph F[Tr*] in 0{n) time and space. In 
addition, if i^'[7r*] is the unlabeled graph of the flow-graph -^[Tr*] we can also 
construct the graph F[7r*] in 0{n) time and space. 

The above properties imply that we are able to extract a watermark w in 
linear time from a node modified or unlabeled fiow-graph -F[7r*]; this can be 
simply done by assigning labels to nodes of F[Tr*] just prior the use of the 
decoding algorithm Decode_RPG-to-SIP. Thus, we obtain the following result. 

Lemma 1. Let F[7r*] be a reducible permutation graph of size 0{n) produced 
by the algorithm Encode_SIP-to-RPG and let F'[tt*] be the graph resulting from 
F[7r*] after modifying or deleting the node-labels of F[tt*]. Given F'[tt*], the 
flow-graph F[7r*] can be constructed in 0{n) time and space. 

7 Concluding Remarks 

In this paper we extended the class of error correcting graphs by proposing 
efficient and easy to implement graph encodings. In particular, we proposed an 
eflicient and easily implemented codec system for encoding watermark numbers 
as graph structures. 

More precisely, we first presented the algorithm Encode_W-to-SIP which en- 
codes an integer w as SIP (self-inverting permutation) tt* in 0{n) time and space, 
where n is the number of bits in the binary representation of w, and the cor- 
responding decoding algorithm Decode_SIP-to-W which extracts the watermark 
number w from the SIP n* also in 0{n) time and space. 

We next presented the algorithm Encode_SIP-to-RPG which encodes the SIP 
IT* as a reducible fiow-graph F[jt*] in 0{n) time and space by exploiting didomi- 
nation relations on the elements of tt*, and the corresponding decoding algorithm 
Decode_RPG-to-SIP which extracts the SIP tt* from the graph F[Tr*] in 0{n) 
time and space by converting first the graph F[tt*] into a directed tree T[7r*] and 
then applying DFS-search on r[7r*]. 

The main features of our proposed encoding and decoding algorithms can be 
summarized as follows: 

• Algorithms Encode_W-to-SIP and Decode_SIP-to-W: use basic data struc- 
tures; apply elementary operations on sequences; have low time and space 

complexity; have an easy implementation; 

• Algorithms Encode_SIP-to-RPG and Decode_RPG-to-SIP: use domination 
relations on permutations; construct dags and lists; use DFS-search on di- 
rected trees; have low time and space complexity; have an easy implementa- 
tion; 

An interesting property of our encoding approach is that of enabling us to encode 
the integer w = 6162 • ■ • 6n as self-inverting permutation tt* of any length; indeed, 
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TT* can be constructed over the set Nn' = {1,2,. ..,n'}, where the smallest value 
of n' is O(logn). 

It is worth noting that the two main components of our proposed codec 
system, i.e., the self-inverting permutation tt* and the reducible permutation 
graph -^[Tr*], incorporate important structural properties, due to the bitonic 
property encapsulated in tt* and the reducible property of -F[7r*], which make 
our codec system resilient to attacks. In particular, these properties unable us to 
identify any single change (in some cases, multiple changes) made by an attacker 
to TT* and F[tt*]. 

Thus, in light of our two codec components tt* and i^[7r*] proposed in this 
paper it would be very interesting to come up with new efficient codec algorithms 
and structures having "better" properties with respect to resilience to attacks; 
we leave it as an open question. An interesting question with practical value is 
whether the class of reducible permutation graphs can be extended so that it 
includes other classes of graphs with structural properties capable to efficiently 
encode watermark numbers. 

Another interesting question with practical value is whether we can produce 
more than one reducible flow-graphs i^i [tt*] , F2 [tt*] , . . . , F„ [tt*] which encode the 
same self-inverting permutation tt* ; we also leave it as an open question (see [3] 
for cographs). 

Finally, we leave as an open problem the evaluation of our codec algorithms 
and structures in a simulation environment in order to obtain detailed infor- 
mation about their practical behaviour. For future investigation, we also leave 
as an open problem the analysis of our codec algorithms under other software 
watermarking measurements. 
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